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^"T Consider a path of non-degenerate eigenstates lips), < s < 1, of unitary operators Us 

^— ^ or Hamiltonians Hg with minimum eigenvalue gap A. The eigenpath traversal problem is 

f^_^ to transform one or more copies of \ipo) into \ipi). Solutions to this problem have appli- 

I I cations ranging from quantum physics simulation to optimization. For Hamiltonians, the 

conventional way of doing this is by applying the adiabatic theorem. We give "digital" 
methods for performing the transformation that require no assumption on path continuity 
I— I or differentiability other than the absence of large jumps. Given sufficient information about 

O , eigenvalues and overlaps between states on the path, the transformation can be accomplished 

■y with complexity O {{L / A) \og{L / e)) , where L is the angular length of the path and e is a 

H 

Cu specified bound on the error of the output state. We show that the required information can 

O^ be obtained in a first set of transformations, whose complexity per state transformed has 
an additional factor that depends logarithmically on a maximum angular velocity along the 

^ path. This velocity is averaged over constant angular distances and does not require continu- 

fy\ ity. Our methods have substantially better behavior than conventional adiabatic algorithms, 

yr^ with fewer conditions on the path. They also improve on the previously best digital methods 

l/~s and demonstrate that path length and the gap are the primary parameters that determine 

^— s the complexity of state transformation along a path. 

> 

X 

U I. INTRODUCTION 

Most quantum algorithms exhibiting speedups are based on one of a small number of basic 
tools for quantum problem solving. These tools include phase estimation, which underlies quantum 
factoring, and amplitude amplification, which can be used to solve search problems. Another such 
tool is adiabatic state transformation (AST). Before finding applications in quantum algorithms. 
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AST was used in classical algorithms for quantum physics simulation [21 [121 I17j . It is now a 
key method for accessing low-energy states in proposed quantum simulations of physics, both 
in "digital" quantum algorithms based on the circuit model [53] and in "analog" simulation 
techniques involving direct realization of Hamiltonians in systems such as optical lattices (see 
Ref. [19;). The potential of AST was recognized in quantum computer science when it was proposed 
as a powerful heuristic for solving satisfiability problems jl^ [TT] . This led to the idea of adiabatic 
quantum computing (AQC), which involves encoding the output of any quantum algorithm in an 
adiabatically accessible ground state [H [11]. More recently, it has been shown that AST can be 
used for quantum speedups of Monte-Carlo algorithms [25\ 128] . 

We consider AST problems that involve transforming I'^o) into iV'i)) where these states are 
the endpoints of a path of states IV's) for < s < 1. The \ips) are eigenstates of operators Os, 
which may be unitary or Hermitian. For simplicity, we focus on the unitary case Os = Us- If 
Os is Hermitian, we define Us = e~**~^*. The eigenphase of IV's) with respect to Us is ips and is 
assumed to be non-degenerate. The gap to the nearest other eigenphase is denoted by A^ > A. 
We make no continuity assumptions on {ips) or Us- Thus, our formulation of the AST problem can 
accommodate cases where the path is discrete, parameterized by integers j = 0, . . . , n. It suffices 
to define \ips) = \ipj) for j/n < s < {j + ^)/n. 

The complexity of the AST problem depends on the available capabilities and what we know 
about the path. We assume that we can prepare copies of IV'o)) and that we can apply quantum- 
controlled instances of Us, both at unit cost. Additional information such as lower bounds on the 
gaps, eigenphase ranges and overlaps between the {ips) may be available. An important consequence 
of our results is that the main parameter that determines the complexity of an AST problem is 
the angular length of the path, which is defined as 



■v 



L = sup <! ^arccos(|(V's,#s,_i)|) 



= So < . . . < s„ = 1 



(1) 



If IV's) is differentiable in s, then L = J^ ||(1 — \ips) {ips\)\ds^s)\\ds- For the purpose of making com- 
plexity statements, let L = max(7r/2,L). Given sufficient information about the overlaps between 
nearby states on the path, we show that the complexity of an AST problem is 0{{L/A) log(Z/e)), 
where e is a specified bound on the error with which \^i) is prepared. (Our complexity statements 
hide constants that apply uniformly for allL>0,0<A<7r and < e < 1.) The dependence 
on L and A is within a factor of at most log(L) of optimal. That is, there are classes of AST 
problems for which every algorithm uses r2(L/A) applications of the Us [8]. With less information, 



we provide algorithms whose complexities have an additional factor that depends on a maximum 
locally averaged angular velocity 

t'max = sup |L(si, S2)/(S1 - S2) < Si < S2 < 1, -L(si, S2) > 0\ , (2) 

where -L(si,S2) is the angular length of the path restricted to [si,S2]; and is a constant. In 
order for Umax to be finite, the path must not have jumps of angular distance 9 or more. The 
average velocity over the whole path is Uavg = L/1. If the set in the definition of Umax is empty, 
set Umax = ^avg- We find that, in general, the transformation can be accomplished with complexity 
0((£/A) log(£/e)) per copy of \tpi) produced, where C = -^(log(umax/i'avg) + 2). Implicit in this 
bound is an extra factor of log(log(umax/i'avg) + 2). It is required only when it is necessary to 
transform multiple copies at the same time, which is the case when sufficiently small ranges for the 
eigenphases are not yet known. The error bound e then applies to the state of all copies together, 
not each copy separately. All our complexities can be refined if the gap varies over the path in 
a known way, allowing faster traversals of parts of the path where the gap is large. We provide 
general tools to analyze this situation. 

Many problems require the production of a large number of copies of |^i), for example to obtain 
good precision on expectations of observables or values of correlation functions. In these cases, the 
first set of transformations can be used to get the overlap information needed to optimize future 
transformations. Thus, except for a first set of transformations, the complexity does not depend 

on Umax- 

We can compare the complexities obtained here to those of other techniques for solving AST 
problems. In the case where the operators Os are Hamiltonians, Og = H(s), the best known 
technique is based on the adiabatic approximation and involves evolving under H{s), changing s 
slowly enough to ensure the desired transformation. This technique works well in analog approaches 
to solving physics simulation problems. Analyses of the adiabatic approximation [H |3l HI El [9l [TSl 
[T6| [20l [23] show that the total evolution time required to obtain |'0i) with error bounded by e 
satisfies t £ O (supg [\\dsH{s)\\'^/A^ + \\d'^H{s)\\/A'^) /e), where || • || is the operator norm. This 
assumes sufficient differentiability of H{s). The dependence on e is much better if H{s) is highly 
differentiable and turned on/off slowly at the beginning and end of the path. The path length 
satisfies L < su]ig\\dsH{s)\\/A. Examples can be constructed where L = Q,{supg\\dsH{s)\\/A), 
so the bound on r has a term that may approach I? j A. Besides requiring differentiability, this 
complexity has at least an extra factor of L. In the absence of differentiability, it is possible to 
randomize the evolution under ^(s) for an average complexity of r = 0(Z^/(Ae)) [71 [23] given a 



discretization of the path with sufficiently uniform angular step sizes. This technique can also be 
applied in an analog setting. In general, the factor of 1/e in the complexities can be improved to 
log(l/e) if we can verify {'ipi) by checking its eigenvalue with respect to H{1). 

The randomized methods in Ref. [^ are also applicable to paths of unitaries. The paths must 
have a discretization with asymptotically small angular distances between adjacent states, implying 
a continuous underlying path. A method with better complexity is in Ref. [2B]. It is based 
on Grover's fixed point search and if applied to our formulation of AST has a complexity of 
C'(Llog(L/e)^/A), given an appropriate discretization of the path. This method was not analyzed 
for arbitrary paths, but a discretization with sufficiently large overlaps between successive states on 
the path works, so continuity is not required. Our work improves on this complexity, and perhaps 
more importantly, shows that good complexity can be obtained even when overlaps between states 
on the path and eigenphase ranges are unknown. It suffices to have lower bounds on the gaps, 
and in the least informed case, be assured that a certain eigenphase dominance condition (to be 
defined below) applies. 

The most salient complexities are summarized in Table |I} It is worth noting that many ap- 
plications of AST to search and optimization problems satisfy that L = 0(1), in which case the 
complexity is dominated by 1/A with additional logarithmic factors in 1/e and fmax/^avg- For 
example, this holds in the direct application to Grover's search problem, where H{s) linearly in- 
terpolates between projectors onto the initial and final states, respectively. In this case we have 
sufficient knowledge of the eigenphases. If we apply our methods without using knowledge of the 
local behavior of the gap or the rate of change of the states, then our complexities have the ex- 
pected quadratic speedup over classical search except for a logarithmic factor due to the speedup 
of the path near where the gap is minimal. However, in this case we are lucky: The maximum 
speed is attained in the exact middle of the path, which is at an angular distance of almost 7r/4 
from either end. Because the algorithms use equal subdivision to recursively implement the state 
transformation (Sect.Q, the transformation succeeds more quickly than expected, and the extra 
logarithmic factor is dropped. For applications to search and optimization there is no reason to 
transform more than one copy. 

After we outline the conventions used in this paper, we introduce a number of basic oracles that 
encapsulate the elementary operations that we need to perform the state transformations. The 
oracles are de&ied to be error-free. Their actual implementations in terms of the Ug are based on 



standard phase-estimation techniques with well understood error behavior. In Sect. IV we develop 



several procedures for transforming an input state {ip) into \(j)) in one step. The procedures depend 



on how much is known about the two states and their overlap. In Sect. |V] we compose these steps 
for transformations along a path. When overlaps are not known, this requires an analysis of a 
recursively defined tree of intervals, which we perform in sufficient detail to enable cost estimates 
that are sensitive to variations in the gap A^ along the path. Most of our algorithms are described 
with components whose number of steps is random and the primary complexity given is the average 
number of steps. To ensure that reversible versions of our algorithms can be constructed with no 
change in complexity, we keep track of the tail behavior of the number of steps, which always has an 
exponential decay. The reversible implementations have a built-in deterministic stopping criterion 



that is a multiple of the average. Sect. VI summarizes the complexities of our state transformation 
algorithms in a table and considers how our results can be generalized to the situation where the 
eigenphases are degenerate, and the goal is to transform a state in an eigenspace of Uq into some 
unspecified state in a corresponding eigenspace of Ui. 

II. CONVENTIONS 

Kets are normalized states unless explicitly stated otherwise. When writing states such as \f) 
for real numbers ip, we assume that \ip) are orthonormal states for distinct ip. The numbers ip are 
to be expressed in terms of labels of computational basis states for a finite system in a reasonable 
way. For instance, if could be written in binary, with the digits corresponding to basis states of 
qubits. The precision used should be appropriate for the context. 

When writing linear expressions involving eigenphases, we always intend them to be valid mod- 
ulo 2tt. For example, when constraining an eigenphase ip hy if G [ipo — 5,ipo + 5], it is intended to 
be read with the expression "mod(27r)" appended. 

For a number of parameters (such as A, L, eigenphases and error bounds), we require that they 
have "reasonable" values. For example, eigenphase gaps should be less than vr and error bounds 
less than 1. We normally take such constraints for granted without specifying them explicitly. In 
most cases, it suffices to replace the parameter by a nearby sensible value if the parameter is out 
of range. 

To transform states along an eigenpath of a path of unitary operators, we ultimately use con- 
trolled forms of these unitary operators and their inverses. However, we initially provide algorithms 
calling on idealized operators defined in terms of the unitary operators and their eigenstates of in- 
terest. We refer to these operators as oracles. When analyzing the complexity of algorithms, we 
do not distinguish between controlled and uncontrolled applications of unitary operators or oracles 



and their inverses. In general, we do not explicitly mention the adjective "controlled" or the term 
"inverse" , leaving them implied. 

When specifying the behavior of an oracle or subroutine, we use the expression "combination of 
states" to refer to any superposition and/or mixture of the given states. Formally, a combination 
of the states |tj) is a state of the form ^^ \ti)\ei)E, where |ej) are unnormalized states of E and E 
is a system independent of previously introduced ones. When we say that T is an operator that 
transforms {ip) into a combination of the states \t^^i), we mean that T(|^)) = ^^ \t'ti;,i)\&'4>,i)E, where 
-E is a system introduced by T and |e^^j) are unnormalized states of E. The total state associated 
with the combination is normalized. We always define such operators so that orthogonal states are 
transformed into distinguishable combinations, and require T to act isometrically. The implicitly 
introduced system E is different for each instance of T in an algorithm. The particulars of the 
combination may also vary with instance. Thus, if an operator or oracle T has been defined in terms 
of combinations of states for a family of input states, the symbol T refers to an arbitrary operator 
satisfying the definition each time it is used. Formally, one can achieve this effect by transforming 
expressions as follows: For each occurrence of T replace it with an occurrence-specific new symbol 
T' and prefix the expression with "for some operator T' satisfying the definition of T" . 

We intend T to be reversible. In the absence of true decoherence processes, this can always be 
achieved. We may use semi-classical language to describe various computational actions such as 
setting a newly introduced register to a particular state, but implicitly rely on such actions having 
reversible forms. If the reverse of T immediately follows T, the system E is effectively eliminated 
by being returned to an initial state. If we performed some other action before reversing T, E may 
play a decohering role. Note that since E is implicit in the definition of T, it is not accessible to 
actions not involving T. If reversals are used, which instance of T is reversed needs to be stated 
if it is not clear from context. When reversals are not used, the statement that T maps {ip) to a 
combination of states |t^,i) is equivalent to the statement that T is a quantum operation satisfying 
that the support of T(|^)(V'|) is in the span of the \t^^i). Besides allowing for reversals of instances 
of T, defining combinations in terms of implicit systems enables amplitude-based error bounds. 

We may want to compare T to an implementation W. Suppose T has been defined on input 
states {ip) as above. We say that the error amplitude of W with respect to T is a if M^ transforms 
the input states into the specified combinations up to a term of absolute amplitude at most a. (We 
omit the adjective "absolute" if it is clear from context.) That is, M^(|^)) = J2i \'t^i),i)\fil>,i)F + kv)' 
where || \r^) \\ < a. Note that the error amplitude is defined as an upper bound, not an exact error or 
distance. We use the fact that error amplitudes are subadditive under composition of in-principle 



reversible processes. Specifically, this holds whenever the processes can be realized unitarily by 
addition of ancillary systems. This may require replacing explicit measurements by steps that 
reversibly record the measurement outcome in the ancillary systems, and performing steps that 
are conditioned on previous measurement outcomes by the appropriate quantum-controlled steps. 
After this change, subadditivity of the error amplitudes follows by writing the final state as a 
combination of the error free state and the errors introduced by each step propagated through the 
subsequent ones. Observe that subsequent steps do not change the amplitude of the propagated 
error. 

Most of the procedures that we analyze are not explicitly formulated in reversible form, and 
the primary complexity measure is an average cost. Because state transformation procedures have 
applications as subroutines in larger quantum algorithms, it is desirable to have reversible versions 
that can exploit quantum parallelism without introducing unwanted decoherence. However, the 
average cost is determined with respect to stopping criteria associated with measurements. When 
the procedure is reversified, one cannot have a stopping criterion that is input dependent. In 
principle, this may require running the procedure much longer than suggested by the average 
cost to ensure that all possible computation paths terminate. Suppose the average cost is C. 
If we allow for some error amplitude, we can set an absolute termination criterion by stopping 
when the total cost has exceeded Cmax- This can be done in a reversible way and introduces 
an error amplitude bounded by y C/Cmax (Markov's inequality for non-negative random variables 
converted to amplitude). This bound is undesirably large and, without additional assumptions, can 
be approached. So we seek procedures where the probability distribution of C decays exponentially 
after some multiple of C. If Prob(C > c) < x'^~^'-' for some < x < 1 and A > 1, then the error 
amplitude for Cmax > AC is bounded by a;{Cmax-AC)/2^ This implies that for an error amplitude 
of e, we can set Cmax = AC -|- 21n(l/e)/ln(l/a;), so that the error dependence of the cost has an 
additive term that is only logarithmic in e. 

In order to keep track of the exponential decay of costs, we use the large-deviation technique of 
bounding the expectation (F ) of T . Thus, whenever it matters, we specify the tail behavior of 
the probability distribution of C by an inequality of the form (T*-" ) < T^ for 1 < F < Fmax- Rather 
than trying to optimize the inequality, we generally choose convenient, simple expressions for Fmax 
and C, ensuring that C is bounded by a constant multiple of the average cost. This suffices for 
stating bounds on complexities while having reasonable estimates for the hidden constants. Given 
such a bound, we can use Markov's inequality to show that Prob(C > c) = Prob(F'^ > F'^) < F*^"*^. 
This is of the desired form, with x = 1/F. When proving tail bounds, we liberally use the fact that 



F{X) = (e ) is log-convex in A. In particular, if (T^) < Fj^ for some Fi > 1, then this inequality 
automatically holds for all F between 1 and Fi. 

The main reason to use the large-deviation technique of the previous paragraph is to simplify 
the estimation of tail bounds for total costs of compositions of procedures. For this purpose we 
have the following lemmas: 

Lemma II. 1. Consider a sequence of procedures Sj with costs Cj satisfying (F ■'|Ci, . . . ,Cj^i) < 
F^^ for l<r< F^ax. Define Ctot,i = ELi C^ ^^d Act,/ = E!=i A- Then (F^-,) < r^tot, for 
1 < r < F^ax. 

For random variables A and B, the expression {A\B) used in the lemma denotes the conditional 
expectation of A given B. 

Proof. The proof is by induction on I using a standard large-deviations approach. Let fi denote 
the measure for the probability distribution of its arguments. 

^pC^toM+i) = /'(F^'+i|Ci,C2,...C/)F^*°*>'d/x(C7i,...,Q) 

< r^i+i /"F^t°'.'d^(C7i,...,C0 

< pC';+ipC'tot,! 

-- pC'tot,;+i 

D 



Lemma ILl implies that if each component procedure has exponentially decaying cost above a 
multiple of the average, so does the composition. The next lemma generalizes this result to the 
case where the number of Sj invoked is not deterministic. In the lemma's statement, the binary 
random variable Wj can be thought of as "Sj was successful" . The lemma is intended to be applied 
when Sk depends only on which of the Sj with j < k where successful. For a sequence of random 
variables Xi, we write Xj = {Xi, . . . ,Xi). 

Lemma II. 2. Consider a sequence of procedures Sj with costs Cj. Let Wj be a binary random 
variable such that Cj and Wj are conditionally independent of Cj-i given Wj_i. Let Vj = 1 if 
Cj > and Vj = otherwise. Define m = Y2j ^j ^^'^ suppose that (A*") < A™ for 1 < A < Amax 
and (F"^^ \^j,Vj) < F^ /or 1 < F < F^ax- Then (T'^tot,^) < F™'^ /or 1 < F < min (aU^, F^ 

The proof is given in Appendix [K\ 
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III. ORACLES 

We introduce four oracles implementing idealized quantum operations. We can implement 
versions of these oracles with low error amplitude in terms of subroutines with full access to the 
unitary operators defining the eigenpath. 

We say that an oracle is [/-controlled if it commutes with any X that commutes with U and 
acts on the same system as U . Such an oracle necessarily preserves eigenstates IV') of C/ in the 
sense that it transforms |'0)|q^) to a state of the form \ip)\f{a,tp)) for any state \a) of other systems. 
In particular, if an instance of the oracle is reversed on a state of the form |V')|&) where \ip) is an 
eigenstate of U and \b) is arbitrary, the result is of the form \ip)\a'). A strictly [/-controlled oracle 
is obtained if it is implemented solely in terms of controlled-[/ operations. The implementations 
described for oracles below that are required to be [/-controlled satisfy this property. 

Our oracle implementations have a specified error amplitude but are still strictly [/-controlled 
when this is required. It is worth noting that for strictly [/-controlled implementations, if a bound 
on the error amplitude is specified only for eigenstates of [/, then it holds in general. To see that 
it holds for arbitrary superpositions of eigenstates, it is necessary to use the fact that the error 
amplitudes for orthogonal eigenstates are orthogonal. This holds because of being [/-controlled: 
For eigenstate inputs, the decomposition of the output state into an error-free part and the error 
amplitude results in both being a product of the eigenstate with states of other systems. We note 
that in general, given error amplitudes only for a basis of the state space, errors in a superposition 
can add as amplitudes rather than probabilities. This may result in a \d error enhancement, 
where d is the dimension. 

Definition III.l. Given a unitary operator U and resolution 5, a phase estimation oracle PE([7, 5) 
is an isometry that transforms eigenstates \ip) ofU with eigenphase if into a combination of states 
of the form 

m^A , (3) 

where (px — (f € [—6,5]. We require that PE is U -controlled. We can implement PE with error 
amplitude e using O {\og{l / e) / 6) applications of U . 

To implement PE it suffices to use a high-confidence version of phase estimation. Such a phase 
estimation technique and its analysis are in Ref. [TS]. PE is [/-controlled because all actions 
involving [/'s system are ancilla-controlled U operations. 
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Definition III. 2. Given a unitary operator U , a phase ipQ and a resolution 5, a phase detection 
oracle PD([/, c/?o,(5) is an isometry that acts on eigenstates {ip) of U with eigenphase (p G [939 ~ 
(5/4, ipQ + (5/4] or ip^ [ifo - 3(5/4, ip^ + 3(5/4] as 

PD(C/,v9o,<5)|^) = |V')|&)^, (4) 

where b = 1 if if £ [(po — (5/4, (^0 + ^/M (^^d 6 = otherwise. Eigenstates \ip) not satisfying 
the eigenphase constraint are mapped to arbitrary combinations of states of the form 1-0)16)^. We 
require that PD is U -controlled. We can implement PD with error amplitude e using 0{\og{l/e)/6) 
applications of U . 

To implement the phase detection oracle with the state(i complexity, apply PE(C/, (5/4), labeling 
its output register A' . Reversibly set register ^ to |1) if ipx G [po — (5/2, pQ + (5/2], otherwise set it 
to |0). Then reverse the instance of PE used. 

Definition III. 3. Given a unitary operator U , a phase ipo and a resolution 5, a reflection ora- 
cle R(f7, c/5o,(5) is an isometry that acts on \^) in the subspace spanned by eigenstates of U with 
eigenphase (p G [(/Jq — <5/4, pq + (5/4] or p ^ [pQ — 3(5/4, pQ + 3(5/4] as 

R{U,po,5m = {-lf\^l:) , (5) 

where b = 1 if p £ [po — 5/4, (/?o + (5/4] and 6 = otherwise. Eigenstates \ip) not satisfying the 
eigenphase constraint are mapped to a combination o/|^). We require that R is U -controlled. We 
can implement R with error amplitude e using C'(log(l/e)/(5) applications ofU. 

To implement the reflection oracle with the stated complexity, apply PD(C/, po, 6). Conditionally 
on the bit in register A change the phase of the input state. Then reverse the instance of PD used. 

A reflection oracle preserves eigenstates but can decohere the phases for eigenstates not sat- 
isfying the constraint by correlating them with the instance-dependent system implied by our 
convention for combinations. 

We note that the implementation of the reflection oracle is a special case of the functional 
calculus of U implemented via phase estimation. For functions / preserving the unit circle in the 
complex plane and slowly varying except in known eigenphase gaps, one can implement f{U) by 
using phase estimation with sufficient resolution, changing the phase according to / and reversing 
the phase estimation. The error can be bounded in terms of the resolution used. A version of this 
observation for more general / (which requires postselection) but not using high-confidence phase 
estimation can be found in Ref. Il4 
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Definition III. 4. Given states \^), \(j)), an overlap threshold a and a resolution 5, an overlap 
detection oracle OV(V', </>, a, (5) is an isometry that transforms \il:) into a combination of states of 
the form 

m\b), , (6) 

where the following holds: /f arccos(|('i/'|0)|) < a — 5, then b = 1 and {ip') = 1-0). //arccos(|('(/'|i?i>)|) > 
a + 5, then 6 = and {ip') = {tp). Otherwise, {ip') is in the subspace spanned by {tp) and \(p). We 
can implement 0\(tp, (p, a, 5) with error amplitude e using Oi\og{\/e)/5) applications of reflections 
around \%p) and \(p). 

Let R^j} and i?^ be reflections around \ip) and \(p), respectively. The overlap oracle is implemented 
by applying the phase estimation oracle PE(C/ = R^R^, 26). The operator U preserves the subspace 
spanned by \tp) and \cp), and its two eigenvalues on this subspace are e^^'^^^'^'^^^Uwrnl) _ "vyg gg^ 
register j4 to |1) if the phase ipx returned by the phase estimation oracle satisfies ipx G [—2a, 2a], 
and to |0) otherwise. We then reverse the instance of PE used. See Ref. [18j for more details. 

If the reflections required for the overlap oracle are implemented in terms of reflection oracles 
with resolution 6' , the overall complexity for an error amplitude of e has a factor of log(l/e)^. We 
generally aim for a dependence on e of log(l/e), which requires bypassing direct uses of overlap 
oracles. 

IV. ONE-STEP STATE TRANSFORMATIONS 

Suppose we are given a system in the eigenstate {ip) of C/ and we wish to transform \tp) into the 
eigenstate \(p) of V. We denote the eigenphases by fu and ipv, respectively. We assume that the 
eigenstates are unique for the eigenphases and that the gaps to the nearest other eigenphases are 
bounded below by A. The overlap probability is denoted hy p = \{'ip\(p)\'^. Define q = {I — p)- The 
methods for transforming the states depend on what is known about the eigenphases, gaps and 
overlaps. Given reflection oracles and p bounded away from and 1, we can use ideas developed 
for fixed point quantum search [27j and QMA amplification |2H [22] for a transformation using a 
constant number of reflections on average. Define reflection operators by R^ = 1 — 2\%p){'tjj\ and 
R(f, = 1 — 2\(p){(p\. Each reflection operator can be implemented with one call to the appropriate 
reflection oracle. The transformation from \'tjj) into \(pi) is accomplished by repeatedly applying 
the circuit RT(0,0) of Fig. [I] until the measurement outcome is 1, indicating that \(p) has been 
prepared. 
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|o) —[h] — f \WY^^ = 

— Rif} — Rcf, 

FIG. 1: Quantum circuit Kr{ijj, (/>) for a state transformation attempt. H denotes the Hadamard gate. The 
fiUed circle denotes control on the state |1) of the corresponding ancilla qubit. 

The effect of KT{ip,4>) is to apply a reflection around 1-0) followed by a projection onto |0) if 
the measurement outcome is 1, or a projection onto the orthogonal complement otherwise. The 
subspace spanned by |V') and |(/)) is preserved by the process. Let \(j)-^) be the state orthogonal to \(j)) 
in this subspace. If the subspace is one-dimensional, choose any orthogonal state. The procedure 
for transforming the states can be analyzed as a Markov chain on the states \ip), \(p) and !</'"'"). We 
consider a slightly more general procedure T(^,(l)) that can be applied to any initial state {ip') in 
the subspace spanned by \ip) and {(p). Define po = \{ip'\(p)\'^. The first step of the procedure consists 
of the circuit for I{T{ip,(j)) with the reflection around \^) omitted. Next, m^{'ip,(p) is applied until 
the measurement outcome on the ancilla qubit is 1, indicating that {(p) has been prepared. The 
transition probabilities for this procedure are shown in Fig. [2] 

In general, when we describe a step of a procedure as a measurement of a state \(j)'), this 
is intended to be implemented by means of a controlled reflection around \(/)') as in the second 
part of RT. The effect is a projection onto \(f)') or the orthogonal complement, depending on the 
measurement outcome. 

To simplify the notation, we omit arguments of procedures such as T and RT when they are 
sufficiently clear from context. The arguments are typically passed on to appropriate oracle calls. 
We may therefore use any set of alternative arguments that are sufficient for specifying these 
oracles. 

Lemma IV. 1. We can transform any state \^p') in the subspace spanned by \tp) and \(f>) into \(f>) 
with a procedure T using (n) = pQ + (1 — po)(l + 1/(2m)) ^ 1 + ^/C^PQ) reflections around the 
states on average. For 1 < F < l/\p — q\, define c by the equation F^ = (1 — cpq)/{p — q)"^ . We 
then have (F") = T{pQ + {1-pq)AT^/c) < AF^/c. 

Proof. The procedure is as described above. Consider the process of Fig. [2] Define n' to be the 
number of reflections used after the first step, if the first step resulted in state \4>'^). We have 
(n) = pq + {1 — po){l + {n')). If the first step failed, the second step can either succeed or return 
to If/*"*"). In the first case, we used n' = 2 reflections. In the second case, the expected number of 
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1-po 




FIG. 2: State diagram for T{tp, (p). The state lip') is the initial state and po is the overlap probability of \ip') 
with \ip). For the non-trivial case of p < 1, the transition probabilities from \(j)'^) are obtained by explicit 
computation of the reflections in the two dimensional subspace of the states. For this purpose, one can write 
W = Vp\^) + ^/g|0^), 1^^) = ^910) - Vp\^^) and |0^) = ^H) - Vp|V^>. 

reflections yet to be used is again {n'). This implies the equation (n') = Apq -2 + {p — q)'^{2 + (n')). 
Thus (n') = 2/{4pq) and (n) = po + {I - po){l + 1/(2^)). 

Similarly, we can calculate (F") by solving the equations (F") = poT + (1 — pq)T{T'^ ) and 
(F"') = 4pgF2 + {p- g)2(F2(r"')). This gives (F"') = 4pqr^/{l - {p - q)'^r'^) = 4T'^/c. The last 
inequality follows because c < 4. D 



According to Lemma IV. 1 , the number of reflections used satisfies an exponential decay with a 
fixed base less than 1 provided that p is bounded away from both and 1. In order to obtain a 
better behaved transformation that only requires p to be bounded away from 0, we use an overlap- 
suppression trick to modify T. We first add an ancilla A in the state y/3/4\0)A + y^l/4|l)^. Write 
1^) = \iP){^/^\0)A + y^l/4\l)^) and \4>) = |(/>)|0)^. Reflections around |V') and \4>) in the extended 
statespace can be implemented with properly controlled reflections around \tp) and |0). We can 
therefore transform \ip) into {(J)) using T(^, (p) After we are done, we discard the ancilla to extract 
the desired state \(f)). The overlap-suppression trick leads to the following lemma: 

Lemma IV. 2. If p > 1/3, we can transform {ip) into \(j)) with a procedure T^ whose average 
number of reflections satisfies (n) < 4. For 1 < F < 7/4, we have (F") < F^. 

Proof. With the technique just described, the overlap probability is changed from p to 3p/4. Given 



the assumed lower bound, this ranges from 1/4 to 3/4. Here, po = 3p/4. From Lemma IV. 1 the 
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average number of reflections used is at most 1 + l/(2(l/4)(3/4)) < 4. 

To prove the bound on (F"), it suffices to show that for F < 7/4, F^ > 4/c and apply the last 



inequality of Lemma IV. 1 The function {p — q)^/{pq) is symmetric in p and q and achieves its 
maximum of 1/2 for the allowed values oi p at p = 1/3. Using the inequality (1 + x)^ > 1 + yx for 
y > and x > —1, we get 

r3 > r^;^ = (1 + (f2 - 1))^^ > 1 + SiLJlLfY^ _ n. 

pq 

Since F^ = {I — cpq) / {p — q)'^ = I + {A — c)pq/ {p — q)'^ , we conclude that F^ > 5 — c. The constraints 
on F and p imply that c > 1, so F'^ > 4 > 4/c. D 



The constants used for Lemma IV. 2 have been chosen for convenience and have not been opti- 
mized. Changing the lower bound on p or the parameters of the ancilla state when redefining \ip) 
changes the bounds in the lemma but does not affect the complexities to be derived later. 

To deal with the problem of low overlap probability p, we extend Tm to a procedure Tx with 
the property that the initial state |'0) is transformed Up is large enough and unchanged for p too 
small. 

Lemma IV. 3. We can implement a procedure Tx that transforms \tp) into a combination of states 
of the form |^)|0)^ and \(j))\l)^, where register A is \1) if p > 1/2 and |0) if p < 1/3. Tx requires 
one overlap oracle call with a resolution of (arccos( 1^1/3) — arccos(y^l/2))/2, a reflection around 
\tp) , and, in the case where the state is transformed into {(p), one instance ofTm with p guaranteed 
to be at least 1/3. 

Proof. To implement Tx, we use the overlap oracle 0V(^, (j), (arccos( y^1/3) + 
arccos(Y^l/2))/2, (arccos(Y^l/3) — arccos(Y^l/2))/2) once to obtain a combination of \ip')\b)^ with 
\ip') in the span of \ip) and |0). We then use a reffection around {ip) to measure {ip). If the state 
is determined to be \ip'^) or if 6 = 1, we apply T„i- The properties of the overlap oracle and the 
choice of parameters ensure that this happens only if p > 1/3. It always happens if p > 1/2. D 

The procedure Tx provides bounds on the overlap at the cost of calling an overlap oracle. As 
noted in Sect. |III[ this results in unwanted overhead when accounting for error amplitudes. This 
can be avoided if one does not seek guaranteed overlap bounds and accepts the possibility that 
even at high overlap, the transformation may not succeed. 



Lemma IV. 4. We can implement a procedure T^ that transforms \ip) into a combination of states 

i-p. 



of the form \ip)\0)^ and |0)|1)^, where register A is \1) with probability 1 — jr^iq — p)'^, and the 
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number of reflections around I?/;) and \(f)) is bounded by n < 5 + n' , where {n') < 1/(1 — p) and 
(r"') < (1 - p)T/{l - pT) forl<T < l/p. 

Register A's contents indicate whether the transformation succeeded. 

Proof. To implement T^, we apply a |(/)) -measurement followed by at most two applications of RT, 
stopping if 10) is detected in the measurement. If after this, the state is \(j)-^) (as indicated by 
the last measurement), we alternately make \^)- and |(/)) -measurements until either \ip) or \(p) is 
detected. Register A is set to |1) if \(j)) was detected in the last measurement made. The probability 
that register A is |1) is at least the probability that the first three steps of the process of Fig. [2] 



terminate at |(/)), which is 1 — (1 — p){q — p) (see the proof of Lemma IV. 1). We can improve this 
by noting that conditional on the failure of these steps, the probability of success in the sequence 
of alternating measurements is (1 — p)pYl'k=P^^ ~ P/i^ + P)- Multiplying by (1 — p){q — pY and 
adding io 1 — [1 — p){q — p)^ we get the overall probability of successful preparation of \4>). 

The first part of the procedure uses at most five reflections. The probability of failure to produce 
an acceptable outcome in a given measurement in the second part is p. Thus, if the first part fails, 
the expected number of refiections used in the second part is 1/(1 — p). Setting n' to the number 
of reflections used if the first part fails (event -F), we have (F" \F) = (1 — p)T + pr(r" \F). To 
obtain the last statement of the lemma, it suffices to solve this equation. D 

For later use, we note the following refinement of the lemma: 

Corollary IV. 5. When T^ is invoked on state {tp), the distribution of the number of reflections 
used by T^ and whether T^ succeeds is independent of any previous events. We also have (F" \A) < 
(1 — |?^)F^/(1 — F-^p^) for 1 < F < l/p, where A indicates success or failure of T'^. 

Proof. Let S^ and E^ be the events that the first part of T^. fails and \(f)) or 1-0) are eventually 
obtained, respectively. The probability of Efj, is p/(l + p). Conditional on E^, n' is even, n' > 2, 
and we get 

^ n'e{2,4,...} ^ 

Similarly, conditional on E^, n' is odd, and (F" \E^) = F(l — p^)/(l — F^p^). The claim follows 
because F > 1. D 

For concreteness, we give the following corollary: 
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Corollary IV. 6. For p > 1/4, T^ transforms into \(j))\l)^ with probability at least 19/20. For 
P < 3/4, the average number of reflections satisfies (n) < 9, and (r"|A) < T^^ for 1 < F < 8/7, 
where A indicates success or failure of T'^ . 

Proof. We use the lower bound 1 — {1 — p){q — p)^ for the probability of success. The minimum 
of 1 — {^ — p){q — p)^ for p G [1/4, 1] is at p = 9/10, and one can check that the value is above 
19/20. Since F > 1, f{p) = (1 — p^)/(l — p^F^) achieves its maximum on p E [0, 3/4] aX p = 3/4. 
We show the last bound for F = 8/7, which is sufficient by log-convexity. In this case F^ > /(3/4) 



and, since n < 5 + n'. Cor. IV. 5 gives the bound. D 



We let T^^ be the procedure obtained by combining the overlap-suppression trick with T'^. The 
procedures T, T^, Tx, T^ and T^^ can be implemented in terms of the reflection oracles R when 
the states \tp) and \(j)) are specified to be eigenstates of U and V with isolated eigenphases known 
to satisfy (fu S [(fu — A/4, ipu + A/4] and c^y G [(fy — A/4, tpy + A/4] , and with gaps lower 
bounded by A. Here (pu and (pv are assumed to be known, but not fu and fy In this situation, 
we say that we know A/2-ranges for the eigenvalues. The instances of the reflection oracles used 
by the procedures are given by R(f/, (fui A) and R{V, (fy, A). We choose the third parameter to be 
as large as possible, because this decreases the complexity of implementing the reflection oracles 
in terms of the unitary operators. For the rest of this paper, whenever we use one of T, T^ and 
Tx, we assume that the reflections used are based on reflection oracles R with the third argument 
given by a known lower bound on the minimum gap, by default A. 

We are interested in the situation where the overlap probability p is bounded away from 0, 
but we do not know a sufficiently small range for the eigenphase of \(p) with respect to its defining 
operator V to use the requisite reflection oracle. We show that a sufficiently small eigenphase range 
can be obtained with low error if we transform many copies of \ip) in parallel into |0), provided 
p is sufficiently large. The statement of the lemma includes an optional projection Ho such that 
no|</>) = 0. This is later used in the context of the overlap-suppression trick so the eigenvalue of 



the modified unitary does not get confused with the original phases (for instance, see Lemma IV.8 ). 

Lemma IV. 7. Suppose that p > 1/2 + 7 with 7 > 0, and V = Vq (B Ho, where Hq is a projector 
that we can use to control other operations and no|(/>) = 0. Then, using 2r instances of phase 
estimation oracles PE(y, A/5), we can implement an isometry ER(y, IIq, A,7) with the following 
property up to an error amplitude of e~'''^' : ER transforms l^)®*" into a combination of states of 
the form 

|^)0.|^±)0(-.)|^)^|^-)^ , (7) 
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where j > r/2 and if — ipy G [—A/5, A/5]. 

Proof. Use system label i for the i'th copy of \ip). We can modify the phase estimation oracles 
PE(y, A/5) so that the eigenphase register contains a non-eigenphase # if the input state is in 
the support of Hq. We implement ER by first using these modified phase estimation oracles 
PE' independently on each \ip\, placing the eigenphase in register Ai. We can express \ip) as 
IV') = ^/p\4') + \/l ~ P\'i^'^)- By definition, the i'th instance of PE' acts as \(f))i i— )• \(t))i\wi)j^,^, for 
some |w^j)a B ^^"^ some instance-specific additional system Ei. 

If we make conceptual |(/))j-measurements, we detect \(j)\ with probability p > 1/2 + 7. Thus, 
with probability at least 1 — e^^'"'' (Hoeffding's inequality jl5|). j > r/2 of the values tpi in 
registers Ai are in [c^y — A/5, ipv + A/5]. Consider this case. Because of the gap condition, other 
eigenphases are outside {(pv — 4A/5, (fy + 4A/5). Thus, there is a tp' with the property that more 
than half of the ipi are in [cp' — A/5, (p' + A/5], and we are guaranteed that ipv is within 2A/5 of 
ip' . Furthermore, any ipi within A/5 of ip' is associated with |0j) and therefore within A/5 of ipv 

We do not make the measurements of the previous paragraph. Instead we reversibly (and 

unitarily) determine whether an interval [cp' — A/5,ip' + A/5] containing more than half of the 

2 
iPi exists. Except for an error amplitude of e~^"' , the state satisfies the condition, which we now 

assume. We reversibly compute the number j and the median ip of the <pi in the interval found. 

As discussed in the previous paragraph, j > r/2 and ip is within A/5 of ipv- We place j in register 

B and ip in register A and reverse all classical reversible computations that were required since 

invoking the phase estimation oracles. Next we reorder systems so that for i < j, the i'th group 

is in state \4>)i\w'^)^,^, and for i > j, it is in the state IvXae- ~ ^'^'\'t''^)i- Here \w'i}^,^_ may be 

different from \wi)j^_^, because of possible correlations with (p. The states I'r])^^.^. have not changed 

because the eigenphases encoded in these states have no correlation with ip. 

For the last step of ER, we reverse the appropriate instance of PE' on each register \ti).j^_^^ . In 

order to do this we need to have computed the reordering permutation into an additional register, 

which we retain only if we need to reverse this instance of ER. The desired combination of states 

has now been obtained. D 



From the proof of Lemma IV. 7, it can be seen that the error is such that the r input systems' 
state remains in the tensor product of the span of {ip) and \(/)). 

To complete the transformation from |-i/;)®'~ to |(/>)®'' after applying ER without using an exces- 
sive number of reflections when p is close to 1, we can use the overlap-suppression trick. 
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Lemma IV. 8. Suppose that p > (4/3) (1/2 + 7) with 7 > given. Then there is a procedure 
Tp{U,V,A,j) that transforms |^)'^'' into a combination of states of the form \(j))^^\ip)^ up to an 
error amplitude of e~^^ , where ^—'-pv G [—A/5, A/5]. Tj, uses less than 2r instances o/PE(y, A/5) 
and an average of (n) < 2r reflections. We have (F") < F'^^ for 1 < F < 7/4. 

Proof. We begin Tp by adding ancilla qubits in state y^3/4|0) + y^l/4|l). Define \ijj) = 
|^)(y374|0) + ^/l/i\l)) and \4>) = \4>)\0). For the purposes of using ER, let Hq = 11 (g) jl/ll, 
where the second factor acts on the ancilla. We apply ER((y |0X0|) ©Hq, XIq, A, 7) and, provided 



j > r/2, we use r(^, (p) (defined in IV. 1 ) on the registers whose state is now indicated to be \(f> ). 
The number of instances of T applied is less than r/2. The bounds on the number of reflections 
used follow by Lemmas IV.2| and ILL D 



We need a method Tpx for transforming states as in Lemma IV. 8 that is well-behaved even for 
small p. In order for the method to work we require that the probabilities of eigenphases other 
than (fv in \ip) have a boundedness property. This will ensure that if {ip) has a big overlap with 
an eigenstate of V, then this eigenstate is \(p). 

Definition IV. 9. We say that ipy is a {'^,5) -dominant eigenphase 0/ y = Vq © XIq in |^) if for 
every 93 and associated projector H onto eigenspaces of Vq with eigenphases in I = [ip — 6,ip + 6], 
|n|^)P > 7 implies ipy G /. We refer to 5 as the resolution at which (pv is dominant and take FLq 
to he zero- dimensional if it is not specified. 

Tpx performs the transformation in the following steps. The first determines whether it is 
possible to confidently find a small interval for ^py without changing each of the r copies of \tjj) 
by much. We then measure each copy so as to project it onto |^) or |V' )• We ensure that the 
probability of recovering a large number of copies of \iIj) is high. If some IV'"'") are found or if we 
found that ipy can be determined sufficiently well, we use the recovered copies of IV") to learn a 
small interval containing ^pv and then transform the states. The first step is encapsulated by the 
next lemma. 

Lemma IV. 10. Suppose that ipy is a {pm — 3^,5) -dominant eigenphase of V = Vq (BHq in \^), 
where Hq is a projector that we can use to control other operations, HqI^P) = 0; Pm — 87 > 1/2 
and pm < 1. Let 5' = min((5/2, A/4). Then, using 2r instances of phase estimation oracles 
PE(y, (5'/2) and r reflections around {ip) , we can implement an isometry 'EJix(y,Ilo,A,6,pm.,'y) 
with the following property up to an error amplitude of 5e~^^ : ERj; transforms \ip)®''' into a 
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combination of states of the form 

|^)«.|^±)0(-.)|5)j^-)^ , (8) 

where if p > Pm, then j = r and b = 1; if p < Pm — 27, then j = r and b = 0; and otherwise 
j > r/20. 



Proof. We use the notation introduced in the proof of Lemma IV. 7 and apply r instances of 
PE'(y, (5Y2) to accommodate the special subspace associated with Ho. We look (reversibly) for the 
first interval // = [{I — 1)6', (/ + 1)5'], I £ {0,1, ... , \2-k/6'~\ — 1}, containing at least {pm — 'y)r of 
the phases in registers Ai. If no such interval exists, we set the state of A to |0)^, else we set it 
to |1)^. Any temporary storage required in the reversible classical computation of the content of 
A is erased. We then reverse the instances of PE' used and make |^)-measurements to determine 
which of the r input registers are in state |'0). Finally, we move the j registers in this state to the 
front and set the state of B to \j)g. 



As in the proof of Lemma IV. 7, after phase estimation, for some set S the state is a combination 
of products of \(p)i\wi)^,^_ for i G 5 and \r]\^,^, for i ^ S. There exists /q such that /jg D 
[ipv — S' /2, ipv + S' /2]. Suppose that we conceptually measure the registers Ai before the reversal 
of the phase estimation oracles. Let ki be the number of measured phases that are in //. Because 
26' < A, kif^ = \S\. In particular, the measured phases in principle determine the members of S. 
(We can use this for the analysis but not for the procedure.) We consider the three cases p > pm, 
P < Pm — 27 and Pm > P > Pm — 27. First, if p > pm, then, from Hoeffding's inequality applied 
to |5|, the probability that |5| > {pm — 'y)r is at least 1 — e~'^^'^ . Hence, with error amplitude at 

2 

most e"*"^ , there is a ki > {pm — l)r, and register A contains 1 before the reversals of the phase 
estimation oracles. The reversals successfully restore the initial state up to the given error. 

Consider next p < pm — 27. The probability that \S\ > {pm — "y)r is bounded by e"^'"'' . We 
show that the probability of finding a ki > {pm — "))r is small. For this purpose, consider the set H 
of / such that ipv I'l = [{I — ?>/2)6' , {I + 3/2)5']. For / G H, any measured phase in Ii is associated 
with an eigenphase in I'^ and therefore different from ipy. Because {2>/2)6' < 6, the dominance 
condition ensures that such eigenphases occur with probability at most pm — 37 in |^). To obtain a 
good bound on the mentioned probability, we consider the /cj's according to Vs location in a small 
partition of H. For any F Q H, let (p{F) be the set of eigenvalues of V in U{/[|/ € F} and P{F) 
the total probability of eigenphases in f{F) in {tp). We claim that we can in principle partition 
H = FiU ...U FiQ such that P{Fi) < Pm - 37. First note that Y.i ^(iO) < 4 because each 
eigenphase occurs in at most 4 of the I^'. We can construct the Fi greedily. Initialize i = 1 and set 
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Fi = 0. Then step through I £ H. If P{Fi U {I}) < p„i — 87, add I to Fi and proceed to the next 
/. If not, because p^ - 87 > 1/2, either P{Fi) > 1/4 or P{{1}) > 1/4. If P{Fi) > 1/4, initiaUze 
Fj_|_i = {I}, update i to i + 1 and proceed to the next I. If P{{1}) > 1/4, set Fj+i = Fj, reset Fj to 
{I}, update i to i + 1 and proceed to the next /. At the end of the procedure, P{Fii) > 1/4 for aU 
i' < i. As we observed above, for ah / G H, P{{1}) < Pm — 87, and the claim fohows. Let k{Fj) be 
the number of measured phases that are in IJieF ^i- Because these phases are due to eigenphases 
in ^{Fi), it follows from Hoeffding's inequality that k{Fi) > {pm — "y)r with probability at most 
g-8r7 ^ j£ ^ satisfies that ipv G I'l, then the measured phases in I; are associated with ipv- This is 
because (7/2)(5' < A. Probabilistic reasoning can be applied, so we conclude that the probability 

of kjasix ^ (Pm — ^)f is at most e~'^'^'^ + lQe~^'^'^ < 17e~^^''' . Thus, with an error amplitude 

2 
of at most 5e~'"^ , register A contains before the phase estimation reversals, and the reversals 

successfully restore the initial state. 

Finally, consider pm — 7 > p > Pm — 27. Because pm — 87 > 1/2, the probability that |5| > r/2 

is at least 1 — e"^'"'' . Therefore, except for an error amplitude of at most e~^'^ , A;max > p — 7 > 

Pm — 87 > 1/2 and /cmax = 15*1. We now assume this condition. After reversing the phase estimation 

oracles, the i'th system is in state \(f)) if i G 5 and 10-*") otherwise. Let j be the number of \ip) 

observed in the |'0)-measurements. The probability of detecting \il)) when the state is \(f)) or \(t) ) is 

p and 1 — p, respectively. The average value of j is p\S\ + {1 — p){r — \S\) > r/2, since p > 1/2 and 

15"! > r/2. By Hoeffding's inequality, for fixed S, the probability of the event E that j < (1 — x)r/2 

2 
is bounded by e~™ . To determine a bound on the overall probability P of E, we must use 

amplitude addition over different S. Thus vP < ^5 -v/^e~^^ '^. There are 2^' possible S, so 

the worst case sum of the y/Ps is 2^'"^. Therefore, P < e'^y^'^i'^)-^ ). "We set x = 9/10 and use 

ln(2) — (9/10)^ < —1/10 to see that the amplitude for having j < r/20 is bounded by e~^''^^. 

By amplitude addition, the overall error amplitude is bounded by e"*"^ + e"*"' ^^ < 2e~'^^ , since 

7 < 1/6. 

To complete the proof, it suffices to determine the maximum error amplitude. The maximum 

2 

error bound comes from the second case and is given by 5e~^"^ . D 



As we noted for the error amplitude in Lemma IV. 7, the error in Lemma IV. 10 is such that the 
r input systems' state remains in the tensor product of the span of \ip) and \(p). 

The last lemma of this section gives the properties of the parallel state transformation procedure 



Tpx that we outlined above. 



Lemma IV. 11. Assume that ipv is a (1— 47, 5)-dominant eigenphase ofV in {ip) with 1— 47 > 2/8. 
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Then there is a procedure Tp^ that transforms |^)®'' into a combination of\(f))'^^\(p\ and |V')'^''It^)a; 
where (/? — (pv G [—A/5, A/5], A's state is |#) if p < 1 — 87 and \(p) if p > 1 — 7, and the error 
amplitude is bounded by Ge"*"^ '^^. The procedure uses less than 4r instances of PE{V,6' /2), where 
6' = Tam{6/2, A/4), and an average number of reflections bounded by (n) < 5r. We have (F") < r^*" 
/or 1 < r < 7/4. 

Note that the procedure imphcitly provides overlap information. That is, if the transformation 
succeeds, the overlap satisfies p > 1 — 87. 



Proof. We use the overlap suppression trick and change each copy of \ip) to 1-0) = |'0)(\/3/4|O) + 
^/lJ4\l)) and define \4>) = |(/))|0). Let Hq = 1 |lXl|. We apply the procedure ER^(y ® |0X0| 
Ho, Ho, A, 5, 3(1 — 7) /4, 37/4) of the previous Lemma. If the first output register (system A in 
Eq. Q) contains 6 = 1 or if the second register (system B in Eq. ([s])) has j < r, we continue. 
Otherwise we set the return register to |#) and stop. 



To continue the procedure, we apply Tp(U, V, A, (3/4)7) (defined in IV. 8) to the first j registers 
(that now contain \^)), omitting the initial overlap-suppressing steps as they have already been 
done. The specification of ER^ and the assumption 1 — 47 > 2/3 ensures that the overlap is 
big enough to apply Tp, which returns ip. We then apply the appropriate instances of T (defined 
in IV. 1 ) to the remaining r — j registers to transform | ip ) into 1 0) . The refiections around | (f)) 
implicitly require (p. To finish we return the r registers and \^)^- 

The error amplitudes associated with the different steps must be added. From the application 
of ER^ we get 5e-'^(3/4)272 ^ which Tp adds at most e-(^/20){3/4)272^ 

The number of instances of phase estimation oracles used comes from the application of ER^; 
and Tp. The average number of refiections is bounded by the sum of r (from applying ER^;), 2j 



(from applying Tp), and at most 4(r — j) (from using instances of T as in Lemma IV.2). The 



reflections in Tp are from applications of at most j/2 instances of T. The total number of instances 



of T is bounded by r. To get the tail bounds, apply the bounds from Lemmas IV.2 and II. 1, with 



an additional offset of r for the first set of reflections. D 

V. STATE TRANSFORMATIONS ALONG A PATH 

We consider paths of eigenstates \^ps) of unitary operators Us with eigenphases ips and gaps 
As as defined in the introduction. We assume the ability to apply any Us and to prepare |V'o)- 
If the exact gaps are difficult to obtain, we take the A^ to be known lower bounds on the gaps. 
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We define ps^t = KV's|V't)P- The goal is to transform copies of the initial state \ipo) into the final 
state \ipi)- The transformations' complexities depend on what is known about the overlaps and 
the eigenphases along the path. They are designed to provide such information if it is not already 
known, so that future transformations can be performed more efficiently. 

Theorem V.l. Suppose that we know a subsequence = sq < . . • < s„ = 1 o/ [0, 1] such that 
Psk,Sk+i — 1/3; o,nd phases ipi satisfying (pi — ip^^ £ [—A/4, A/4]. We can then transform |^o) to 
\ipi) with m reflections R(f/si, c^j, A^.) where (m) < 4n and (T™) < T^" forl<T< 7/4. 

Proof. It suffices to apply Tm with the reflections instantiated by reflection oracles to advance from 



each state to the next. The complexities follow from Lemma IV. 2 and II. 1 D 



Given sufficiently large overlaps, the phases can be inferred to sufficient precision during a 
parallel state transformation. Note that the eigenphase (po of |^o) for Uq can be determined to 
within Ao/4 by one call to a phase estimation oracle with resolution A/4 and input state IV'o)- We 
therefore assume that a phase sufficiently close to ipo is known and reflections around j'i/'o) can be 
applied. 

Theorem V.2. Suppose that we know a subsequence = sq < . . . < Sn = ^ of [0,1] such that 
Psk,Sk+i ^ (4/3)(l/2 + 7) with 7 > 0. We can then transform iV'o)'^^ iiT'to |^i)'^'~ with an error 
amplitude ofne~^'^ . The transformation requires 2nr instances of phase estimation PF^lUs^, A^./S) 
and an average of (m) < 2nr reflections R(C/s. , c^i, AsJ. Furthermore (r™) < r^'" for 1 < F < 7/4. 
The transformation provides phases (pi satisfying (pi — ipg. E [—A/5, A/5] for i > 0. 

Proof. It suffices to apply Tp{Usi_j^,Usi,As.,^) n times to transform the states. The complexities 
follow from Lemma IIV.8I and III. 11 D 

The number of underlying calls to phase estimation oracles per copy of {tpi) produced by the 



procedure of Thm. V.2 is within a constant factor of that of Thm. V.l| (where phase estimation is 
used for the implementation of the reflections). The implementations of the oracles in the former 
case have an additional overhead to achieve the error goal, see Sect. |VI[ 

Theorems |V.l and |V.2| suggest that we can obtain state transformations along paths with 
complexities bounded by the path length. In particular, if the overlap probabilities Psi,Si+i are 
bounded above by cos(0)^, the path length is at least n6. Although it is possible for n to be much 
smaller than the path length due to shortcuts, generically we do not expect this. If the angular 
rate x{s) = ||(]1— |^sXV's|)|5sV's)|| along the path is defined and constant, x{s) = Xi then regularly 
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spaced Sj = 7r/(8x) ensure that the overlap conditions for the theorems above are satisfied and 
n = [SL/tt]. On the other hand, if many overlaps are close to 1, n could be large compared to L. 
We can eliminate this possibility if we have sufficient information about the overlaps, or after the 
first transformation by checking n overlaps during the transformation, as the next lemma shows. 

Lemma V.3. Let 9,9,9 > with 9<9<9 + 9< tt/2 be given. Consider a procedure for 
transforming copies oflipo) into copies o/|V'i) in n steps, where the steps transform from \ipti^i) to 
\ipti) with arccos(|(^i._J'(/'t-)|) < 9. Neither n nor the ti need to he deterministic, hut we assume that 
after |^t-) has been reached, information required to perform reflections and call overlap oracles 
for states |V't ) with j < i is available. We can then modify the procedure so that it outputs a 
sequence S* = {0 = sq < • • • < -5^ = 1} satisfying arccos(|('(/'si_i|V'si)l) ^9 + 9 for all i and 
9 < arccos(|(^s._-^|^s.)|) for i < k so that L(S) > 9{k — 1). To do so requires n calls to overlap 
oracles 0Y{\iJjti),\ipt'),i9 + 9)/2,{9 — 9)/2), n explicit reflections on \tpti) o-nd k invocations of 
T{\i^u), IV'i,)) wtth cos(9f <p< cos{9f. 

The procedures we describe satisfy that the information required to call reflection and overlap 
oracles is available when needed by the modification in the lemma. 

Proof. The Sj are elements of {tj}j. We begin by setting sq = 0. We ensure that at the end of 
the rth step (/ > 1) of the modified procedure, the last Sj that has been determined satisfies the 
invariant arccos(|(^s iV't;)!) ^ 9. To do so, let Sj be the last member of S that has been determined 
before the Tth step. If / = n, set Sj+i = 1. Else, after the transformation into copies of \ipti) has 
been accomplished, call the overlap oracle 0V(|^(;), \ips), (9 + 9)/2, (9 — 9)/2) on the first copy of 
\ipti)- Then use a reflection around \ipti) to determine whether the state was preserved. If not, or if 
the overlap oracle returned 1, set Sj+i = ti. If the state was not preserved, then call T{\tps.), \ipti)) 
to restore it. This completes the modification of the Tth step and ensures the desired properties 
for the Sj determined so far and the invariant. The lower bound on the length follows by adding 
up the lower bounds on the angular distances between successive \ipsi) for i < k. D 

When the overlaps ps^t or the angular rates x(-s) are unknown, the state transformation requires a 
recursive procedure to find a sequence of successive states with sufficiently high overlap. We assume 
that such states can be found, more specifically, we require that there are no jumps of angular 
distance equal to some given constant or greater. Our recursive state transformations involve 
binary subdivision of intervals. To transform the state from \tpa) to IV'fe)) we check whether we can 
do it directly at a cost of C{a, h). If not, we recursively transform from jf/^a) to \ipia+b)/2) ^-i^d then 
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from |'0(a+())/2) to \ipb)- We are interested in the total cost of the transformation. For our purposes, 
the cost is the number of times the unitaries Ug are used. This is determined by the number 
of times phase-estimation is used either directly or indirectly when applying reflections. The 



resolution required is typically the gap, and the cost is related to the inverse gap (Sect. Ill), which 
can depend on the position along the path. To enable taking this into account we provide general 
tools for analyzing the complexity of recursive path transformations based on binary subdivision 
in Appendix [B] 

We define the symmetric binary interval tree on [a, 6], BIT(a,6), as the set of intervals con- 
structed by starting with T = Tq = {[a, 6]} and recursively adjoining [c, [c + d)/2\ and [{c + d)/2^d] 
to T for every [c, d] in T. We also define the cost of BIT(a, h) as 

C(BIT(a,6))= Y^ C{c,d) . (9) 

[c,d]eBIT(a,b) 

With the appropriate choice of the cost function C, this is the cost of a recursive state transforma- 
tion procedure, and we show that it depends linearly on length and at worst logarithmically on the 
ratio of maximum to average angular rates. For simplicity, we use this estimate to state complexity 
bounds for state transformations where relevant, with the understanding that local-cost-sensitive 
estimates can be obtained if needed. 

Let Cmax = SUp{C(si, S2) \ a < Sl < S2 < b}, Umax = SUp{(L(s2) - L(si))/(s2 - Si) | « < Si < 

S2 < b, L{s2) - L{si) > 9} and v^^yg = {L{b) - L{a))/{b - a). If L < 9, define Umax = Vavg- 
Lemma V.4. IfC{c,d) = for L{d) - L{c) < 9, then 

C;(BIT(a, b)) < log2 h 3 Cmax • (10) 



The proof is given in Appendix [B] 

Let Umax and Uavg be as defined in Lemma V.4, where by default a = 0, b = 1 and 9 is clear 



from context if not specified. Note that finiteness of Umax requires that the path has no jumps of 
angular distance 9 or more. 

Theorem V.5. Suppose that ips is a [1 — ^'y, 5) -dominant eigenphase ofUg in {tpr) for all r < s, 
where 1 — 47 > 2/3. Let 6' = min(5/2, A/4), 9 < arccos(^/l — 7) and C = 2L(log2(umax/i'avg) + 
3)/^+l. Then we can transform |'0o)'^'^ ii^to iV'i)'^'" with an error amplitude bounded by 6Ce~^^ '^^ 
with at most ArC instances of phase estimation oracles with precision at least 5' , an average number 
of reflections bounded by (n) < 5rC and (F") < F'^'"'^ for 1 < F < 7/4. 
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Proof. To implement the transformation, we apply Tp^ of Lemma IV. 11 recm'sively to the intervals 
of the BIT, terminating at intervals where the transformation succeeds. The lemma guarantees 
that the transformation succeeds if the angular length of the interval being tried is less than 
arccos(-v/l — 7). 

To determine the complexity of the transformation, we need to consider a modified tree cost. 
Let Ce{a,b) = C{a, {a + 6)/2) + C{{a + b)/2,b) if L{b) - L{a) > 6 and Ce{a,b) = otherwise. 
Define 

C'e(BIT)(a, b) = C{a, b) + Ce(BIT)(a, b) . (11) 

This accounts for the fact that the shortest intervals in the tree that require action can be associated 
with arbitrarily short angular lengths. It is their parents whose angular length must be too long 
for terminating the recursion. With C{a,b) = 1, Cg(BIT){0, 1) is an upper bound on the number 



of intervals for which transformation is attempted. According to Lemma V.4 , this is bounded by 



IV. 11 



C = 2L(log2(fmax/^avg)+3)/0+l. The rest follows by multiplying the complexities in Lemma 

by C and applying Lemma |II.1[ For the error amplitude we used amplitude addition. D 

The next theorem can be applied when little information on overlaps is available, but we know 
sufficient eigenphase ranges for performing the necessary reflections. For this we need to consider 
the case where the transformation has probability of success ps < 1 when L(b) — L{a) < 6. In this 
case, the process of subdividing [a, b] may continue indefinitely. For ps > 1/2, the expected number 
of intervals considered is finite with an exponentially decreasing tail probability. This follows from 
the theory of Galton- Watson processes, but in Appendix [C] we give a statement and proof sufficient 
for our purposes. 



Theorem V.6. Let 9 = arccos(y'l/3) ~ 0.96. Suppose that we know phases Cps satisfying ips — (ps £ 
[—A/4, A/4]. We can transform |^o) ii^'to |^i) using (n) < n = 40L(log2(fmax/i'avg) + 3)/^ + 10 
reflections, with (F") < F^^" /or 1 < F < 14/13. 

Proof. To implement the transformation, we apply T^^ to the intervals of the BIT recursively. 



(T^^ is defined after Cor. IV. 6 ) The recursion terminates when a transformation succeeds. For 



the intervals of angular length greater than 6, Cor. IV. 6 characterizes the distribution of the number 



of reflections used whether or not the transformation succeeds. For intervals / of angular length 



at most 9, the number of subintervals that need to be tried is characterized by Lemma C.l, where 
the success probability satisfies ps > 19/20. Whether an interval needs to be tried depends only 
on whether any of the intervals containing it (that is, above it in the BIT) succeeded. Because 
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of Cor. IV. 5, Lemma II. 2 can be applied. Thus, according to Cor. |IV.6[ the total number nj of 



reflections used in our transforming across / satisfies {nj) < 9ps/{2ps — 1) < 10 and (F"^) < 
pii(H-2/{2p,-i)) < p36 for 1 < r < 14/13 < mm{{l/{2y/psil - PsW/^\8/7}. To finish the proof, 
as was noted in the proof of Thm. |V.5[ the number of intervals of the BIT whose parents have 
angular length greater than 6 is bounded by C = 4L(log2(i'max/wavg) + 3)/^ + 1. These intervals 
form a subtree BITg. Every reflection can be associated with the smallest interval in BITg for 
whose traversal it was used. The statistics of the number of reflections associated with each such 
node are bounded by the ones we obtained for intervals of angular length at most 0. Thus, we can 



apply Lemma II. 1 to complete the proof. D 



VI. SUMMARY OF COMPLEXITIES 

The most salient complexities are summarized in Table |Ij The bounds apply uniformly for 
L = max(7r/2,L), < A < vr, and < e < 1/2. We also define C = L (log (umax/'Wavg) + 2). In 
order to obtain the bounds, it is necessary to take into account the error amplitudes contributed 
by two sources and make sure they do not exceed the error goal of the algorithm. The first is 
in the implementation of phase estimation and reflection oracles, and the second in our multi- 
copy transformations. Calls to either oracle in our algorithms require O (log{l / 6) / A) uses of the 
underlying unitary operator for error amplitude 6. If the state transformation requires M phase 
estimations or reflections, then we can set 5 = e/(2M) to ensure that the total error is bounded 
by e/2, since error amplitudes are sub-additive. Thus the complexity in terms of uses of the 
relevant unitary operators is O {M \og{M / e) / A) . The additional error in our multi-copy state 
transfer algorithm is bounded by e^^'''' per state transformation attempt and is given explicitly in 
Theorems |V.2| and |V. 5 In our algorithms, the number of phase estimation and reflection oracle calls 



per copy is linearly related to the number n of state transformation attempts. The latter determines 
the total error from this contribution, which is ne~^^'''K Thus, for an error goal of e/2 we can set 
r = G(log(n/e)). This requires C'(nlog(n/e)) total phase estimation and reflection oracle calls. 
After accounting for the error in the implementation of these oracle calls, we obtain a complexity 
per copy of C'(nlog(nlog(n/e)/e)/A) = C'(n(log(n/e) + loglog(n/e))/A) = C'(nlog(n/e)A). 

The formal meaning of the columns in Table|I]for assumed knowledge can be determined from the 
statements of the referenced lemmas and theorems. Having knowledge of overlap approximations 
means knowing enough about the Ps^t to be able to pick = sq < . . • < s„ = 1 such that Psj^i,sj is 
large but bounded away from 1. This ensures that transforming along the Sj is possible and efficient 
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Knowledge assumed 




Overlap 
approx- 
imations? 


Overlap 

lower 
bounds? 


Eigenphase 
ranges? 


Eigenphase 
dominance? 


Cost per copy 


Number of copies 
required 


Reference 


%/ 




V 




Ki-©) 


1 


Thm. V.l 
Lemma V.3 




V 






-d'-©) 


e (,„,(=)) 


Thm. 


V.2 






V 




KK-©) 


1 


Thm. 


V.6 








V 


KK-©) 


e(,.(^)) 


Thm. 


V.5 



TABLE L Path transformation complexities. The entry in the column "number of copies required" gives 
the minimum needed by the referred-to algorithm to achieve the desired error amplitude. In this case, the 
error amplitude applies to all copies simultaneously, so unless there are strong error correlations, individual 
copies may have substantially less error. We have not determined the extent to which the transformations 
of the copies can be parallelized. If eigenphase dominance applies, we assume A is also a lower bound 
on the resolution for dominance. The maximum angular velocity Wmax can be bounded with 9 — il(l) in 
Lemma |V.4[ The results in Appendix [D] may also be helpful, n is determined by = sq < . . . < s„ = 1 
where psi.sj+i > 1/2 + f^(l)- We have L — 0{n), but n could be substantially larger than L if wc do not use 
preprocessing as in Lemma |V.3[ 



in terms of path length. Knowing overlap lower bounds ensures the former only. When we say that 
eigenphase ranges are known, we mean that for all s, we know an interval (or more generally, a set) 
containing ips such that the distance from this interval to every other eigenphase of Us is at least 
A. This is sufficient for implementing the reflections with low error. The eigenphase dominance 
condition ensures that we can statistically distinguish the wanted eigenphase when using multiple 
copies of the states to infer adequate eigenphase ranges. The formal definition for a path is in 
Thm. |V3] based on Def. Wl^ 

Lemma V.3| can be used to preprocess the transformation steps so that the complexity of the 
ffist row of Table ffl applies to subsequent transformations. Use of Lemma V.3 requires 0{n) calls to 
overlap oracles, where n is the number of actual transformation steps used. Thus, the complexity 



has an additive term of C'(nlog(n/e) ), according to the note after Def. IIL4 However, if the 



recursive subdivision technique is used as in the last two rows of the table, the use of overlap 
oracles can be avoided. 

We have given the key complexities in terms of global quantities that are simple to state. The 
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complexities actually depend on local aspects of the path. In particular, if the gaps A^ are typically 
large compared to the minimum, sections of the path can be traversed much more quickly. This 
can be taken into account by a finer complexity analysis, for example by taking advantage of 
Lemma IB.2I 

Our analyses apply to paths of non-degenerate eigenstates, but much of it can be extended to 
paths of eigenspaces as follows. Suppose that the path is characterized by a family of spaces Zg 
consisting of a union of eigenspaces of Ug whose set of eigenphases have a minimum distance A^ to 
all eigenphases of states orthogonal to Z^. The multi-copy transformation algorithms used when 
we have insufficient information about the eigenphases require that Z^ is an eigenspace. Reflections 



around Zg can be implemented with the functional calculus of Zg as noted after Def. III. 3 The 
goal is to transform an arbitrary initial state |^o) £ -^0 into some iV'i) G Z\. To generalize our 
analysis, it is necessary to redefine the path length. Let lis be the projector onto Zg. We define 
L([a,6]) = supL(a = sq < . . • < s„ = 6), where L(so < ... < s„) = I^?=o ^('^i' ^i+i) ^^"^ 
0{s,t) = max{arccos(|nt^s|) | ipg G Zg}. Note that having no large jumps in the path implies that 
the dimension of Zg is non-decreasing. The basic transformation steps are the same, but their 
analysis requires the observation that the reflections around the subspaces Zg and Zt are a direct 
sum of reflections on two-dimensional subspaces of the space spanned by Zg and Zt, see Ref. [26]. 
Within each such subspace, the transformation behaves as expected. The relevant overlaps now 
depend on the relationship between the reflection axes in the mentioned two-dimensional subspaces. 
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Appendix A: Proof of Lemma II. 2 



Let ^ be the measure for the probabihty distribution of its arguments. The conditional indepen- 
dence assumption is equivalent to having the Cj and Wj generated via the sequence of probabilistic 
transitions . . . — t- (Cj,Wj) — t- (Wj) — t- (Cj+i, Wj+i) —)■.... It follows that Cj is conditionally 
independent of the Ck for k < j given Wj . Formally we have 

= dfi{Ck, Wk\Wk_^)d^l{Ck-l\Wk-l)d^^(Wk-l) 

= dfx{Ck,Wk\Wk^i)dfi{Ck-i,Wk-i) 

= dfi{Ck\Wk)dfi{Wk\Wk-i)dfi{Ck-uWk-i) 

' k ^ 

\{d^{C,\Wj)d^{W,\W,^i) 

J]d/i(C7,|W,)|d/i(Wfc), 

where the omitted identities involve applying the first steps recursively to the last term. 
Given the constraints on F, we obtain 

ny"r^^dMc,iWj)jd/x(Wk) 

n I (i - y, + V, j T^^d^,{c,\w^, v,)^ d,,{v,\W;) J d^,{w^) 

^ / ( n /(I - ^j- + ^.r^)rfM^ilWj) I d^(Wk) 
ny"r^^^(iM^,|Wj)jdMWk) 

ny"r^^^(i/i(c,iWj)jci/i(Wk) 

Jr^^Uy^dfi{Ck), 
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because Vj is a function of Cj. To get the desired bound, we let k — t- oo, use the monotone 
convergence theorem, and apply the bound on (A™") with A = F : 



lim fr^^'S-^^idniCk) = f lim r^^'^=^^^dfi{Ck 

fc— >oo J J fe— >oo 

= (r'^™) < A*^ . 



Appendix B: Recursive transformations complexity 

Let L{s) be the length of the path \tpt) from t = to t = s. Define s{l) = inf{s : L{s) > l,s £ 
[0, 1]} and s{l) = sup{s : L{s) < l,s £ [0, 1]}. We have s{l) < s{l) and the state is constant on 
the open interval (s(/),s(/)). The functions s and s are monotone. Given / € [L(a),L{b)] and a 
distance scale 6, we define a local maximum speed variation at / by 



ag{l, [a,b]) = sup||5|^ I a < si < S2 < S3 < S4 < 6, (Bl) 

L(S2) < / < L(S3), 

L{S3) - L{s2) > e, 
L(s4)-L(si) < 2^1 . 

If the set under the supremum is empty, let o"6i(/, [a, b]) = 1. To justify the description of ae as a 
speed variation, we define average speeds between /i and I2 > h by v{li, I2) = ih — h)/i§.ih) — sih)) 
and v{li, I2) = {h — h)/is{l2) — s(/i)), allowing for constant sections on the path. A more direct 
local speed variation is 

pe{l,[a,b])= sup [^^^ \ L{a) < h < I2 < l< I3 < U, (B2) 

^3 = ^2 + e, 

k = mm{h+2e,L{b))\ . 

If the set under the supremum is empty, let pe(Z, [a,b]) = 1. The description of a0{l, [a,b]) as a 
speed variation at / at scale 9 comes from the observation that 

Lemma B.l. For all intervals I 

ae{l,I)<2pe{l,I) (B3) 

with equality if I G [L{a) + 26,L{b) — 20], the path is continuous and has no constant intervals. 

Proof. The set X in the definition of ctq{1, I) is empty iff L(6) — L{a) < 9 or I ^ [-^(^)) L(b)]. If the 
latter holds, then the set Y in the definition of P0{1, 1) is empty. If not, then either Y is empty or 
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L{b) = L{a) + 6. In this case Y = {y} with y > I. Consider r = (54 — si)/(s3 — S2) G X, with Sj 



as in Eq. Bl Let li = L{si) and define I'l = h, l'^ = niin(/i + 26,L{b)) > /4 and choose I2, l'^ such 
that /^ = V^ + e,l2<l'2<l<l'i< h- Then 

-s{Q-s{l'^) 



r < 



S{1'3)-S{1'2) 



< 2<'^3-^2^^(^4)-^W) 



j4-^w ^(^3)-^a2) 

For the reverse inequality, given e > arbitrarily small, choose li such that r' = v{l2, h)/ll{h, h) > 
pg{l,I) — e. The constraint on / implies that U = h + 29, so r' = 2{s{U) — s{li)) / {3(13) — s(/2))- 
Let 5 > be arbitrarily small. Let s, = s(/j) for i = 2,4 and Sj = s{lj) for j = 1,3. Continuity 
implies that L{si) = li. If I2 > h, then S2 > si and we let 82 = 82 — S. Else ^3 < ^4 and we let 
S3 = S3 + 6. Unassigned s'- are set to Sj. The assumptions imply that for 6 small enough, the 



s^ satisfy the constraints in Eq. Bl, showing that ag{l,I) > 2pg{l,I) — e. Letting e J, gives the 



desired result. D 

The next lemma gives a bound on the cost of a symmetric binary interval tree that is sensitive 
to local variations. 

Lemma B.2. IfC{c,d) = for L{d) - L{c) < 9, then 

1 fL{b] 

k=o~ •'-^(") 



C(BIT(a,6)) < -^- / [[log2{a2.eil,hb]))\ +l)C2.eil)dl , 

;._n J L{a) 

(B4) 



where Ce'{l) = sup{C(si, sa) | L(si) < I < 1(82), 9' < L(s2) - L(si) < 29'}. 

Proof. The bound is obtained in three steps. First we separately sum over intervals [si,S2] in the 
tree in each length class 2 9 < ^(sa) — -^(-si) < 2 ~^^6. Second, we uniformly, randomly assign the 
cost C(si, S2) to the open interval between L{8i) and L{82) and integrate over the length variable. 
Third, we use bounds on costs and numbers of intervals in the tree spanning the value of the length 



32 



variable. The steps are implemented in the following sequence of identities and inequalities: 
C(BIT(a,6)) = Yl Cisi,S2) 

[si,S2]eBIT(a,fe) 

oo 

fc=0 [si,S2]eBlT(a,b), 

2'=e<L(s2)-L(si)<2'=+le 



EE 



C{si,S2] 



k=0 [si,S2]eBIT(a,6), 

2'=e<L(s2)-L(si)<2'=+le 

oo 

^ EE 

fc=0 [si,S2]eBIT(a,6), 

2'=e<L(s2)-L(si)<2'=+le 



L(s2) - L(si) yi(^^) 



i(«2) 



dZ 



1 /■i(6) 

C{S1,S2)^J l[L(,^),L(,,)](/)d/ 



< 



I^^q'^''^ JLia) \ [si,S2]6BIT(a,fe), 



l[L(si),L{s2)](0 ^^2*5(0^^ 



2'=6l<L(s2)--L(si)<2'=+ie 

To finish the proof, note that the number in parenthesis in the last line, 

{[si,S2] G BIT(a,6) I L{si) < I < L{s2),2^e < L{s2) - L{si) < 2''+^e^ 

is the size of a set of nested intervals in the tree. Let [si, S4] be the biggest and [s2, S3] the smallest 
of these intervals. Because of the way the tree is constructed, the number of these intervals is given 
by 



log2((s4 - Sl)/(S3 - •S2)) + 1 < Cr2''e + 1 



where the inequality follows from the definition of a. 



2'= 61 



D 



Finally, we give the proof of Lemma V.4 stated in the text 



Proof of Lemma V.4 We begin by bounding fj^y{log2{o'ei{l,[a,b]))dl for 6' > 6 in terms of the 



lL{a) 



maximum and average angular rates. If L{b) — L{a) < 9' , then ag'{l, [a, b]) = 1 so the integral is 0. 
Assume L(6) — L{a) > 6' and let e > be arbitrarily small. For every I, choose a < si{l) < S2{1) < 
■83(0 — 84(1) < b such that the following hold: The Si = Si{l) satisfy the constraints given in the 



definition of ag'{l, [a,b]) (see Eq. Bl), (Tg>{l, [a,b]) < 2^(s4(/) — ■si(/))/(s3(/) — si(/)), and the Si{l) 



are measurable functions. We can now bound 

"^(^) /-^(^^ sa(I)-sUI) 
log2{ae'{l,[a,b]))dl < {L{b) - L{a))e + / log2 ^7^ %(d/ • 

L(a) JL{a) Ss{l) - S2{1) 
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Jensen's inequality, applied to the concave log2 function, gives 

L(a) S^{l)-S2(l) L{b) - L[a) J L^a) S3{1) - S2[l) 

We bound the denominator inside the integral with the inequality (s^^l) — S2{1)) > ^'/"^max; which 
does not depend on the variable of integration, so we can continue by bounding the integral of the 
numerator. We first change the order of integration 

Lib) Mb) i-si(l) .b .Lib) 

(s^il) - si{l)) dl = / dl dt= dt l|i|ig[^^(,)^^4(i)]}(/)(i/ . 

L{a) JL{a) J si{l) J a J L{a) 

If t G [si(/), S4(/)], then L{1) < L(s4(0) < L{si{l)) + 29' < L{t) + 26'. Similarly, L{1) > L{si{l)) > 
L{s4{l)) — 26' > L(t) — 26' . It follows that the inner integral is bounded by 4:6' . This gives the 
bound 

1 f^^^K /,N .,XN „ 1 r\.fr 4:6' (b- a) 46' 

' {s4{l) - si{l))dl < —— — - / 46'dt- ^ ' - 



L{h) - L{a) Ji^a) L{b) - L{a) J^ L{b) - L{a) Uavg 

Combining the bounds and letting e go to 0, we obtain 

<-L{b) / 

log2(ae'(/, [a, 6])) < {L{b) - L{a)) logs ^^ + 2 

L{a) V ^avg 



Substituting into the bound of Lemma B.2 we get 



1 .°°. 1 / \ 

C(BIT(a,6)) < ^ (L(6)-L(a)) log2^^ + 3 W7„ 

^ k=0^ V ^avg J 

2{L{b) - L{a)) ( i;^ax ^ „ ^ 

< 7, log2 \- 3 Cmax , 



as desired. D 

Appendix C: A Galton- Watson lemma 

Lemma C.l. Consider the random process starting from S = {([a, 6], active)} which is defined 
recursively as follows: For all N = ([c, d], active) in S, replace N with with ( [c, d] , inactive) and 
with probability 1—ps arfd ([c, (c+d)/2], active) and ([(c+(i)/2, d], active) toS. Let Soo be the possibly 
infinite set obtained by running this process countably many times. For ps > 1/2, the expected size 
ofS^ is (|5oo|) =Ps/{2ps - 1) and (rl^-l) < T^+Vi^Ps-i) forl<T< 1/(2^^^(1 - P.))- 

Proof. We outline the proof, omitting some necessary existence arguments. Either ^oo = 
{([a, b], inactive)}, which happens with probability's, or ^oo = {{[a, b], inactive) } U 5"^ U 5,. , where Si 
and Sr are independent with the same statistics as 5oo- The latter happens with probability 1—ps- 



34 

Thus (|S'oo|) = Ps + 2{1— ps){\Soo\), or (jS'ool) = Ps/i^ps — 1). Similarly, using the independence of 
Si and Sr again, we find (rl"^""!) = r (ps + (1 - ps)(rl'^°°l)^). This is solved by 

<^""'> = 2(CT (' ^ ^'-"■^''''-f'-^')) ■ 

The relevant solution is the negative branch, as can be seen by checking that (fl"^""!) = 1. The 
maximum value of F for which it is defined is F = l/{2y^ps{l — Ps)). For this F, (fI"^""!) = 2psT. 
The following sequence of inequalities together with log-convexity completes the proof: 

2ps = l + (2p,-l) 
2ps-l 



< 1 + 
= 1 + 

£^1 + 



4ps{l -ps) 

1 \ / (2p, - 1)2 



2ps-lJ \Aps{l-ps) 

{2ps - 1) 2 \ sj^ 
Aps{l -p. 



s) 

1 



1 \ 2ps-l 2 



^4ps(l -ps), 

at the maximum F. D 

Appendix D: Angular rate estimation 

Here are some tools for estimating average angular rates for paths of eigenstates of normal 
operators. 

Lemma D.l. Let H he a normal operator with eigenstate \ip) having eigenvalue A. Suppose H + S 
is normal and has an eigenspace V with eigenvalue X + 6 gapped by A. Then the maximum angle 
from \tp) to V is bounded by arcsindKS* — (5)1^^)11 /A). 

Proof. Let Ft be the projector onto the orthogonal complement of y. It suffices to show that 
|(^'|n|^)| < IKS' — 5)|^)||/A for all \tp')- We can use the "resolvent trick" to express 

n = ^Jl-{z-{{H + S)-{X + 6)))-'dz 

= 7^ f(.z- {{H + S)-{X + 5)))-\{H + 5) - (A + 5))dz/z , 
2m I 



where the integral is over a circle of radius d less than A around 0. Thus 

KV^'lnlV')! = {i^'\^j{z-{{H + S)-{X + 5))r\S-5)dz/z\i 
< \\{S - 5)m\{^ - d)-' . 
The result follows by letting d go to 0. D 
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Corollary D.2. Let H(s) be a family of normal operators, H(s)\ip{s)) = \{s)\ip{s)) with all objects 
difjerentiable at s = t. Let IIj = II — |'0(t))('0(t)|. Lf the eigenvalues X{s) are gapped with gap A in 
a neighborhood of s = t, then 



n; 



d\4,{s)) 



ds 



s=t 



< 



dH{s) 



ds 



s=t 



/A. 



(Dl) 



Proof. It suffices to apply first-order perturbation theory to Lemma D.l , noting that H{s) = H{t) + 



{t-s){dH{s)/ds\s=t) + o{\t-s\) andX{s) = X{t) + {t-s){i;it)\{dH{s)/ds\,=t)\ij{t)) + o{\t-s\). D 
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